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Bone-related cadherin-lrke protein and process for its production. 
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A bone-related protein named OSF-4 which is obtainec from bone tissue of a mammal including mouse or 
human, and a process for its production. This protein is a novel naturally occurring mammal protein of the 
cadherin family. 

OSF-4 acts as an adhesion molecule or a growth factor which takes part in the process of osteogenesis at 
the site of bone induction. OSF-4 can be used as an agent for treating bone metabolic diseases, and its high 
organ specificity for bones enables its use as a diagnostic reagent for bone metabolic diseases. 
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This invention provides a novel bone-.eiated protein. It is named OSF-4 and belongs to a group of 
cadhcrin molecules. The OSF-4 can be obtained from bone tissue of a mammal including mouse or human. 
This bone-related protein is useful for the diagnosis and treatment of bone metabolic diseases. 

Bone metabolic diseases include osteoporosis, Pagefs disease, osteomalacia, hyperostosis, and 
osteopetrosis Osteoporosis, in particular, has a high incidence enough to affect about more than a half of 
postmenopausal women and elderly people, and effective methods for its diagnosis and treatment have 
been strongly desired. 

Bone metabolic diseases involve some disorder of bone metabolism at the cellular level in bone tissue. 
The discovery, isolation and identification of factors associatr)d specifically with bone metabolism are very 
effective for elucidating this disorder. 

A cell line of osteoblasts which play a maior role in osteogenesis, has been used and a proteinaceous 
fartor produced specifically by this cell line has been identified. Therefore, the present invention provides a 
novel protein named OSF-4 which is substantially bone-specific, and which is highly homologous with 
vanous known cadherin type cell adhesion molecules in terms of amino acid sequence. 

The OSF-4 can also be produced from the DNA sequence described in the present specification by an 
ordinary genetic engineenng technique known in the art. Furthermore, the OSF 4 or its fragment can be 
produced from the amino acid sequence described in the specification by a chemical peptide synthesis 
method. Moieover, that fragment of the DNA sequence of the OSF-4 described in the present invention 
which is specifically different from other cadherin molecules can be synthesized with a length of 15 to 50 
bases by an ordinary chemical oligonucleotide synthesis method. That fragmentary sequence can be used 
as a DNA probe for finding and identifying bone-denved cells. This identification of bone-derived cells is 
useful particularly for grasping the origin of metastatic or recurrent carcinoma, thus leading to an 
appropnate therapy for recurrent cancer. Of the partial peptides of the OSF-4. the peptide in the epitope 
portion that can be recognized by antibodies is usable for preparing a monoclonal antibody specific for the 
OSF-4. The resulting monoclonal antibody is of marked value for identifying bone-derived cells by an 
immunological ceil tissue staining method. Because of its similarity to cell adhesion molecules, the OSF-4 is 
also useful for the treatment of fracture 

OSF-4 is a bone-specific cadhenn-like proteinaceous factor. The following is known about cadherin 
which is a cell adhesion molecule involved in morphogenesis 

The segmentation of cell population is one of the most basic elements for the construction of an animal 
body This segmentation begins at a very early stage of morphogenesis. As the differentiation of cells 
proceeds, the same types of cells migrate and become reorganized in an orderly manner, thereby 
performing morphogenesis as well as the construction and maintenance of tissues. One of the elements that 
control such cellular migration is the selective adhesion of cells Cells have the features of recognizing 
adjacent cells or adjacent extracellular matrices, and adhering to only particular ones. In accordance with 
the differentiation of cells, their adhesion specificities vary. Consequently, these cells may leave particular 
sites migrating to and gathering at the sites where they should have originally been situated. So far. 
numerous cell adhesion molecules have been identified and all show cell type specificities. These 
molecules can be roughly classified into more than 4 groups, i.e. cadherin family, immunoglobulin 
superfamily adhesion molecules, integrin superfamily. selectins, and those not belonging to these cate- 
gories (Hynes et al., (1992) Cell, vol. 68, pp. 303-322) 

Cadherins, in particular, have been well analyzed, and not only their structures, but their functions have 
also been extensively studied. Cadherins are glycoproteins with molecular weights of about 120 kD, and are 
Ca2*-dependent intercellular adhesion molecules (Takeichi, (1991), Science, vol, 251, pp. 1451-1455). 
Nearly 10 types of them have been identified. The respective types have binding specificities, and the 
same molecules react homophilically with each other. As a result, cells having the same type of cadherin 
bind selectively to each other. Thus, cadherins are considered indispensable for determining the specific- 
ities of intercellular adhesion. Typical examples are E-cadherin (epithelial cadhenn), P-cadherin (placental 
cadherin), N-cadherin (neural cadherin). and L-CAM (liver cell adhesion molecule). All of them are similar in 
structure; each is composed of an extracellular region comprising 4 to 5 repeats of about 110 amino acids, 
a transmembrane region, and a cytoplasmic region comprising about 150 amino acids. Comparisons among 
the respective subclasses have shown about 50% identity of the ammo acids. In the extracellular region, the 
Identity rate is higher at a site nearer the N-terminus; it becomes gradually lower the nearer the 
transmembrane region; and it is maximal in the cytoplasmic region. In recent years, new cadhenns have 
; been reported, including, for instance, M-cadherin (muscle cadherin). B-cadherin (brain cadfierin). T- 
cadhenn (truncated cadherin), and desmoglein localized in desmosomes. These cadherins may originate 
from a single ancestor gene, and constitute the cadherin family. 
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stua:.:-£ >jf a /aonunn rroioculc fiavo sncwn \^^^\ :ites tMriciing t-j ca^rium -jr^s ar^^J 
ao^:■f:r^;rllng bmoing sp^ocifiotv are present in its N-termmal leryrx' The cytopiasr^iic dC'main r.as been fourvi 
te funetionaiiy important for tHe adhesion property, and tj b^nd to a protein suoh as eatenin or actin. 
Through these furictions. eadhenns have been suggested to contribute to cytopiasmio signanng. During 
ifivasen ana metastasis of cancer cells, these eadhenns uncen^c quantitative changes, and so their 
relat.onsnip witfi oncogenesis has attracted a broac attemion Despite many such reports of cadherm 
mclccules, there have been no reports of cadhenn moie^culr^: with specificity for osteoblasts. 

Bone formiatic-n and ma;ntenance are dependent on the nalance between osteoblasts which forrTi bones 
and ■:.steoclas:s whicti resorb bones Osteoblasts are mesefKhyn^ai cells of the same ongm as myobiasrs 
and .ijpocytes while .jsteociasts originate from stem cells as do noLJtfOphils and macrophages, (Jstceclasrs 
are rnown to exoress vitronectin receptors wtiich belong to the [rUegnn family With osteoblasts ilie 
pros ( -nee of cell a:]hesion molecules has only bc^en suggested 

The Object of the present invennon is to fmd a new type of cell aahesfor factor wh ch is specifically 
evpressed in c-steoblasts. Such a new cell adhesion factor is an irTiportant molecule for the proliferation. 
G'fferentiation, rTiigration and reorganization of osteoblasts. ThiS substance can be expected to find use m 
the diagnosis and treatment of various bone metabolic diseases. 

cDNA of mouse OSF-4 {mOSF-4) was isolated from a mouse osteoblastic cell line MC3T3-E1 cDNA 
library constructed by a combination of PGR (polymerase chain reaction) and the subtraction method, and 
by differential screening. Then, the mouse OSF-4 cDNA was used as a probe for screening cDNA libraries 
obtained from hunnan osteosarcoma cells, As a result, two types of OSF-4, named hOSF-4-1 and hOSF-4-2, 
were obtained, and their nucleotide sequences were determined. The nucleotide sequence of the OSF-4 is 
very well conserved between mouse and human. Comparisons between mOSF-4 and hOSF-4-1 as well as 
hOSF-4-2 show 97,1% and 96.4% identity in ammo acid level, respectively (Tables 1 to 3). The very high 
conservation between these species suggests that OSF-4 has essent.al roles in vertebrate bone metabo- 
lism. OSF-4 can be isolated and punfied from the bone e^racts of other vertebrates. The OSF-4 of other 
animal species can be obtained from cDNA libraries or genomic libraries constructed from their bones, 
cultured bone cells and other body tissues by recombinant gene technology using the cDNA of the present 
invention or its DNA fragment as a probe. Search through tf^e currently available DNA and ammo acid 
sequence data bases demonstrated the sequence of the cDNA m the present invention to be novel. 

In companson with the amino acid sequence of mOSF-4, hOSF- 4-1 shows high conservation in ttie 
whole domain. hOSF-4-2. on the other hand, is completely identical with hOSF-4-1 in terms of the N- 
terminal to 631 st amino acid residues. Because of the insertion of 179 bases into the transmembrane 
region, however, frameshift occurs, with the result that the 62 ammo acid residues ranging from the 632nd 
amino acid residue to the 693rd-position C-termmus assume a completely different structure (Tables 1 to 
35 3). Hence, the C-terminal 9 ammo acid residues in the transmembrane region, and the cytoplasmic region 
are completely different between hOSF-4-1 and hOSF-4-2. Such cadherm with the C-terminal region deleted 
corresponds to T-cadherin. This type of cadherm has been suggested to take part m the control of cell 
adhesion. 

A peptide corresponding to 15 hydrophilic ammo acid residues at the 101st to 1i5th positions in the 

40 ECl domain of mOSF-4 was chemically synthesized. This peptide was conjugated with KLH (keyhole limpet 
hemacyanin), and used for immunization of rabbits. The resulting anti-mOSF-4 peptide antisera were used 
for immunohistochemical detection of OSF-4 in systemic slices of the neonatal mouse. OSF-4 was detected 
in the osteoblasts, chondrocytes and so on. 

Generally, the OSF-4 can be directly e^tracted from bone tissue or cartilage tissue of a human, bovine, 

45 murine or other source by a known biochemical technique. 

The DNA coding for the OSF-4 can be obtained by constructing a cDNA library or a genomic library from 
mRNA extracted from vertebrate bone tissue, and using a probe comprising a labeled fragment of the 
mouse DNA sequence disclosed in the present specification A full length cDNA clone can be obtained by a 
combination of the above-described and other standard techniques on molecular biology, 

50 As described above, OSF-4 shows homology with known representative cadherm molecules, but it is a 

cadherm molecule belonging to a new subclass different from those so far reported Its structure is 
composed of 5 repeats m an extracellular region, a transmembrane region, and a cytoplasmic region (Fig, 
1). Comparisons between OSF-4 and other eadhenns have shown that homology m the extracellular region 
becomes lower from the N- terminus toward the transmembrane region and the highest homology is noted 

55 in the cytoplasmic region, according to the homology pattern among the existing different cadhenn 
molecules (Table 4). 
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EC1 


EC2 


EC3 


EC4 


EC5 


TM 


CP 


MP 


0SF-4:N 


39.3 


46.8 


32.8 


29.8 


27.4 


59.4 


50.3 


39.4 


0SF-4:E 


32.7 


40.4 


30.7 


30.8 


17.2 


34.4 


47.1 


33.9 


0SF-4:P 


34.6 


35.8 


36.0 


29.1 


19.0 


23.1 


43.8 


33.3 


0SF-4:M 


31.8 


38.5 


27.6 


26,0 


25.4 


34.4 


45.1 


33.2 


N:E 


60.2 


53.5 


45.1 


43.9 


27.0 


46.9 


64.1 


49.8 


N:P 


51.9 


51.8 


45.1 


49.5 


27.9 


30.8 


57.7 


47,3 


E:P 


65.7 


61.1 


52.7 


46.7 


38.9 


53.8 


79.7 


58.7 



In Table 4 the homology of ammo acids in each region was calculated and expressed in %, The 
abbreviations are as follows; EC1 to ECS, five extracellular regions; TM, transmembrane region; CP, 
cytoplasmic region; MP, mature protein; N, N-cadherin; E, E-cadherin; P, P-cadherin; and M, M- cadhenn. 
In the column "Types of cadherin compared." 0SF-4:t^ denotes compansons of the ammo acid sequences 
in the respective regions between OSF-4 and N-cadhenn (the same is true for the other combinations). 

The protein provided by the present invention is a group of glycoproteins, named OSF-4, which belongs 
to a new cadhenn subclass and plays an important role in osteogenesis. More concretely, the human and 
mouse OSF-4 proteins described in this specification are included. OSF-4 is expressed in osteoblasts 
during the process of bone formation, and acts as a cell adhesion molecule and a morphogenesis-related 
substance. These human and mouse OSF-4 proteins can be used to identify and isolate other mammalian 
OSF-4 proteins similar in DNA sequence and amino acid sequence. 

The present invention further provides polypeptides comprising analogues of OSF-4, i.e. mutants and 
fused proteins having OSF-5 adinty, as well as fragments of the OSF-4 which can be identified as OSF-4 
related particularly with at least 10, preferably 15 amino acids. The cDNA of mouse OSF-4 isolated from 
the mouse osteoblastic cell line MC3T3-E1 encodes a protein consisting of 796 amino acids, including a 
signal peptide composed of 24 amino acid residues. There are two isoforms of human OSF-4 which were 
isolated from a human osteosarcoma cDNA library. One cDNA clone, human OSF-4-1, encodes a protein 
consisting of 796 amino acids including a signal peptide composed of 24 ammo acid residues. The other 
cDNA clone, human OSF-4-2, encodes a protein consisting of 693 amino acids including a signal peptide 
composed of 24 amino acid residues. The present invention also provides a process for producing OSF-4 
by recombinant DNA technology. 

According to the present application the term "hybridization under stringent conditions meers hybndiza- 
tion conditions with a salt concentration of 6 x SSC (NaCI-citrate puffer) at 62-68 -C. 

Brief Explanation of Tables and Figures 

Table 1 shows an alignment of the amino acid sequences of mouse OSF-4. human OSF-4-1 and human 
OSF-4-2. Common amino acid residues are shown in the form of consensus. 

Table 2 shows a continuation of an alignment of the amino acid sequences of mouse OSF-4, human OSF-4- 
1 and human OSF-4-2 shown in Table 1. Common amino acid residues are shown in the form of 

consensus. _ _ 

Table 3 shows a continuation of an alignment of the amino acid sequences of mouse OSF-4. human ObF-4- 
1 and human OSF-4-2 shown in Table 2. Common amino acid residues are shown in the form of 

consensus. . ^ 

Figure 1 is a schematic drawing of the structure of mouse OSF-4 precursor protein. OSF-4 precursor 
protein is divided into eight regions, a signal region (shaded part), five extracellular regions (EC1, EC2, ECS, 
EC4 and ECS) a transmembrane region (TM) and a cytoplasmic region (CP). 

Figure 2 shows a restriction enzyme map of cDNA coding for mouse OSF-4. The bold letters indicate the 
region coding for the amino acid of OSF-4. There are no Kpnl and Sail sites in the map. 
Figure 3 shows the tissue-specific expression of mouse OSF-4. This was analyzed by purifying RNA from 
vanous tissue and cultured cells followed by RNA dot blotting. This diagram shows the results of 
autoradiography. 

Figure 4 shows the map of expression vector pMSS60. It is mentioned that the content of the Japanese 
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Tne presort i-ven'ion will be ooscnbed m more dotaii by reference to the foilowmg Exammes: 
Example 1 

C^enstraction of cDNA hlyary by subtraction ar^d PGR 

The construe:. Ofi of a cDNA hbrary specific for the osteoblastic coll iiru: MCara-EI wiM be hereinafter 
described. This cDNA I brary is coristructed from MC3T3-E1 cDNA library by a combination of the 
subtraction methcd and n^e PGR with the geno expressed m rmouse liver tissue being subtracted Each 
cDNA clone has gene fragments with an average length of about 300 bases, and is characterized :n that the 
gene with a low content has been amplified too. 

Unless otherwise specified, all general rocorT^bmant DNA protocols ccmphed witn Sambrook et al., 
"Molecular Glonmg Manual" {1989). Cold Spring Harbor Laboratory. Gofd Spring Harbor, U.S.A. Total RNAs 
were extracted from 8x10^ MG3T3-E1 cells and about 1 g of mouse liver tissue by the guanidme method, 
Pofy A* RNAs were punfiea from the total RNAs by means of the corT^mercraily available product "Oltgo dT 
Latex rmRNA PunficatKDn Kit" (Takara Shuzo), cDNAs were synthesized by a cDNA synthesis kit (Amer- 
sham) using 1 ug of each poly A^RNA as a template. However, a random primer was used, instead of an 
ohgo dT primer, m an amount of 1.5 times its ordinary amount used, whereby the cDNA chain elongation 
was restricted to an average length of about 300 bases After the cDNAs were made double-stranded and 
blunt-ended by use of the above kit, they were joined with T4DNA ligase (Takara Shuzo) to the following 
two DNA linkers, i.e. ATOS-1 2 (Sequence ID Nos. 4 and 5 of the Sequence Listing) for the MC3T3-E1 
cDNA, and ATOS-4,5 (Sequence ID Nos. 6 and 7 of the Sequence Listing) for the liver cDNA: 

ATOS-1/2: 

ATOS-1 5'- CTCTTGCTTGAATTCGGACTA-3' 

ATOS-2 3'-ACACGAGAACGAACTTAAGCCTGAT-5' 

ATOS-4/5: 

ATOS-4 5'- CTCTTGCTTAAGCTTGGACTA-3' 

ATOS-5 3'-ACACGAGAACGAATTCGAACCTGAT-5' 



4Q Then, each reaction product was subjected to DNA amplification by the PGR (polymerase chain 

reaction) method using ATOS-1 and ATOS-4. respectively, as primers. The amplified DNA concentration 
was deterrTiined with the DNA assay kit "DNA Dipstick" (Invitrogen). The subtraction method was performed 
using photobiotin (Pirce). Photobiotin (20 ng) was added to 20 ug of the PCR-amplified liver cDNA. and 
light from a sunlamp 10 cm apart was projected onto the liver cDNA for 10 minutes to label it with biotin. To 

45 3.0 ug of the labeled iiver cDNA was added 0.3 ug of unlabeled MC3T3-E1 cDNA for hybridization. Then, 
streptavidin {Takara Shuzo) was reacted, and the reaction mixture was extracted with phenol to remove 
cDNA common to the liver cDNA from the MC3T3-E1 cDNA. The subtraction method was repeated to 
remove as much of the common cDNA as possible from the MC3T3-E1 cDNA DNA was amplified by PGR 
using the aforementioned ATOS-1, and the DNA concentration was rrieasured. This cDNA (10 ng) was 

5(} digestec with the restriction enzyme EcoRI. and then ligated with T4 ligase to 1 ug of the phage vector 
lambda gt10 (lambda gtiO EcoRI cloning kit. Stratagene) which was digested with EcoRi and de- 
phosphorylated at its ends. The resulting recombinant DNA was pacMged into lambda phage particles by 
use of the in vitro packaging kit "Gigapack-gold" (Stratagene), The recombinant phages were infected into 
E. coll 0600 (preserved as HT003 at Japanese Gancer Research Resources Bank, National Institute of 

55 Health of Japan), and the organisms were applied to an agar medium along with a soft agar medium to form 
phage plaques. The efficiency of infection was determined to be 3x10^ phage plaques ug vector DNA. 

The resulting cDNA library was subjected to differential screening to select clones with a high 
specificity for MG3T3-E1, Goncreteiy. 2.25x10^ phages were applied to total 10 plates, and the resulting 
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plaques on each plalc wco t.anslorred to two nylon membrane filters (total 20 fUters). These ser.es of 
olaque. w.re suhiectcd to plaque hybridization with radiolabeled MC3T3-E1 cDNA as the probe for one of 
Ihe senes'and wi'th radiolabeled Hver cDNA for the other series. In 273 clones, expression was observed 
with the MC3T3-E1 CDNA probe but not with the Hver cDNA probe. These clones were used as a mmi- 
llbrary in subsequent e:<periments 

Example 2 

Isolation of mouse OSF-4 clone 

A description will he made of methods to identify a cDNA fragment of OSF-4 as an MC3T3-E1 specific 
Clone from the mini-library constructed in Example 1, and to clone full length cDNA from the cDNA hbrary 
ot MC3T3-E1 with the use of this fragment, i ,,n 

The total RNAs from MC3T3-E1 and liver prepared in Example 1 were spotted m an amount of 1 ug 
each onto nylon membrane filters, 273 of the filters were prepared, and used for hybridization to be 
described later on. Separately, the DNA of the inserts of the 273 phage clones prepared in Examp e 1 was 
amplified by PCR. This DNA was agarose gel electrophoresed. and mam bands were cut out, punfied, and 
fad' labeled for use as a probe. A clone showing expression with MC3T3-E1 cDNA bu, expression with 
liver CDNA upon autoradiography was recloned into a plasmid vector. Concretely^ the DNA o< the insert^ 
amplified by PCR and then purified was digested with the restriction enzyme EcoRI. and recloned into the 
EcoRI site of the plasmid vector pUCl18 (Takara Shuzo). The DNA sequence of the resulting clone was 
determined with commercially available "DNA Sequence Kit" (Takara Shuzo) using a -^'^^l^^^^^'^ 
search through DNA and protein data bases showed that DNA sequence to constitute a clone '^o^^otogous 
with the existing cadhenn. Th,s clone was designated as D45, and used for subsequent cloning of the full 

""%o' cloning of the full length cDNA, blund-ended double- stranded cDNA was synthesized with the 
CDNA ,synthesis kit "cDNA Synthesis System Plus" (Amersham) using 5 ug of the poly A RNA of MC3T3^ 

.. , . c < Th" ,-,^.,1*;..-- ^r-iMA vv^--^ linatAd tn FroRI/NotI adaptor (Takara Shuzo) using 14 

tl puririea in Exainpit; i. 1 he rrsouiu.iy w„.. ..^ . , vnn kooo 

liqase and the product was agarose gel electrophoresed to purify a fragment more than about 700 base 
X "ong This fragment was joined to the EcoRI site of lambda gt10 phage vector ftratagene , and 
packaged into phage particles in the same way as in Example 1. The packages were infected into E. col, as 
in Exar^ple 1, and the efficiency of infection was determined to be 1.5x10' phage plaques/ug vector DNA^ 
The aforementioned D45 was radiolabeled for use as a probe, and 1 0x10^ phage clones of the cDNA 
library were screened by plaque hybridization. Fourteen positive hybridization signas were obtained^ 
whereafter the Not! fragment of the phage clone with the longest insert was recloned into the Not! site of the 
plasmid vector pGEMI lZf( + ) (Stratagene), The resulting clone was designated as pKOT164. 

Example 3 

Determination of mouse OSF-4 DNA sequence 

Deletion mutants of the pKOT164 and a subclone containing its cDNA fragment were prepared with 
"the Deletion Kit for Kilo Sequence" (Takara Shuzo) by cutting at Intervals of about 300 base pairs in each 
opposite direction. The DNA sequence of each deletion mutant was determined with the automatic DNA 
sequencer 373A (Applied Biosystems, U.S.A), The entire DNA sequence of the cDNA, and an ammo acid 
sequence translated from this DNA sequence are shown as Sequence ID No, 1 of the Sequence Listing, 
The protein encoded by this cDNA was designated as OSF-4. No. 1 of the amino acid residue corresponds 
to the N-terminus of the predicted OSF-4 precursor protein. The restnction enzyme map of that cDNA is 
shown in Fig. 2. 

Example 4 

Tissue specific expression of mouse OSF-4 

RNA dot blotting was performed to Investigate the tissue specific expression of mouse OSF-4. The total 
RNAS of the thymus, spleen, brain, kidney, liver, lung, testis and heart of mice <P^/^^^^^^^^ '^'^f °^ 

Clea) were prepared by the guanidine method, Calvarial osteoblast-rich cells were obtained fror^ a culture 
of newbom mice calvaria. Total RNA was extracted from these cells in the same way as descnbed above. 
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''.'^^o .i'^} of t^^/ :':ia! F^rJA -^r^^ fr-,ni :r^o abo vo-rmentionco nssuus. cultured calvanai cells. M(:3T3-El and 
riiOLiSC- fioroblast ceil lifio rjlH3T3 (ATCC CRL 1658) was dotted onto nyion membrane fiittrrs (E.^ooyne 
PALL) fixed by baking, ane used for hyi; noization . Separately, the pKOT164 was digested Aith TJotl, ara 
irolatea and purified by agarose g»:l electrophoresis. Then, the isolate was radiolabeled and used as a 
;;robe Autoraaiography mdisatea hi.jh eypression ror the cultured calvanai cells and MC3T3-E1. and low 
*:^'pression for the lung and *estis (Fig. 3). 

Evarr;f:!ie 5 

blomng of cDNA coding for human (:)SF-4 

Tlie NotI fraginent contamifig the cDNA region of pKOTt64 was purified and used as a probe to screen 
a human ostocjsarcoma cDNA library ce^nsisting of 1.3>10 clones. Twenty-one positive signals wore 
ootained, and 5 clones were isclated. Two clones with large inserts were recloned into plasmid vector 
;5 fjHSG398. The resulting plasrriids were de:::ignated as pK0Ti61 and piKOTl70. 

t ^amp)le 6 

Determination of human OSF-4 DNA sequence 

20 

Prom the pK0Tl6l and pKOT170 cloned in Example 5 and their subclones, deletion mutants were 
prepared in the same way as m Example 3. Then, their DNA sequences were determined. These DNA 
sequences and the ammo acid sequences predicted from them are shown in Sequence ID Nos. 2 and 3 of 
the Sequence Listing. The proteins encoded by these oDNAs were designated as human OSF-4-1 and 
25 human OSF-4-2. The ammo acid residue No, 1 of each of them corresponds to the N-terminus of the 
predicted OSF-4 precursor protein. 

E ' ample 7 

30 Pioparation of anti-OSF-4 antisera 

In preparing anti-peptide antibodies against mouse OSF-4. the corresponding 15 ammo acid residues in 
the EC1 were synthesized by the solid phase synthesis method us,ng a peptide synthesizer (430A. Applied 
Biosystems), in accordance with an experimental report on M-cadhenn (Donalies et al., (1991), Proc, Natl, 
35 Acad. Sci.. USA., vol. 88, pp 8024-8028). The synthetic peptide was OSF-4. 1 (FVIDDKSGNfHATKT 
Sequence ID No. 8 of the Sequence Listing). This synthetic peptide was conjugated with KLH (keyhole 
hmpet hemacyanin) using glutaraldehyde as a coupling agent, and used for immunization of rabbits. The 
resulting antisera could be used to search immunohistochemically tor the presence ol OSF-4 in newborn 
m::)use systemic slices, and to detect the opression of OSF-4 m E coll. yeast and animal cells 

40 

E. ample 8 

E'pression of OSF-4 in animal cells 

-^5 The present example describes the preparation and e-pression of the e>pression vector for mouse 

OSF-4 m animal cells and the functional analysis of the prodLJced OSF-4. 

There is an open-reading frame m the 5'-fiankmg region of the OSF-4-codmg region m the base 
sequence as shown in SEQ ID NO: 1 m Sequence Listing. The open-reading frame was expected to 
■.lecrease^ the translation efficiency of OSF-4. Therefore, a done which contained the OSF-4-codmg region 

50 al';ne was selected from the delation mutants prepared in Example 3, and it was used tor the preparation of 
•he expression vector for OSF-4. The segment of the clone from G cf the I9lst to A cf the 2700th m SEQ 
ID NO: 1 in Sequence Listing was cut and a linker containing Xhol and BamHI s^tes and a linker contaimr^g 
Xr, s'te were ligated to the 5'-terminus and 3'-terminus of the segment, respectively. Then the segment 
bordered with Xhol site was inserted into the Xhol site of an expression plasmud vector for ammal cells, 

55 pO:N2 (Niwa et al., (1991) Gene. vol. 108, p193-200). The OSF-4 expression vector obtained was termed 
pMSSeO (Fig. 4). 

The pMSSeO was introduced into L-cells of a fibroblas' cell line derived from mouse epidermis by the 
ca elum- phosphoric acid co-precipitating method. Then 12 G418 resistant colonies transformed by pMSSOO 
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wr.„. rulU.,CMl coparateiy to obtain the cloned cell lines. RNA was oxt, acted from these cloned cells Then, 
Th e OSFihfgh producer cloned cell „nes were selected w,th an RNA do. blott,ng method by us.ng 

c. n<;F A rnNA as the orobe The three clones were termed CI . C7 and C1 1 , respectively. 
'^^Tband of ap^ox mately Too .Da react.ng w,th ant,-0SF-4 ant,bodv was detected by Western b,ott,ng 
analv^t*= out of the proteins produced in these cloned cells. 

FuMhermore the functional analysis of OSF-4 produced by these cloned cell l.nes was conducted as 
follows by Takouch.'s aggregation assay method wh,ch had been er,g,nally established to examine cadhenn 
cell adhesive properties (Takeich. et al., (1977) J. Cell Biol , vol 75. p464-4T4)^ ,,p^^,pri 
Fr^t from each monolayer cells (CI. C7 and CM). TC-and TE-troated ceil suspens.ons were prepared. 
T e TC "eafed sus ensions were prepared as fellows. The monolayer cells were rinsed three times 
with CMF solutf n (PucR's Ca- and Mg- free saline: J. E.p, Med. vol. 108 p954- 956, ^^^SKontamin^^ 
mM canum chloride Then the cell suspensions were incubated at 37 'C for 15 mm with HCMF solution 
" EPet Jfen r ine 80 g of NaCI, 0.4 g of KCI, 0.09 g of Na.HPa.7H.0, 1 g of glucose, 2.38 g of 
HEPES and 4 8 mTof lN NaOH in 1,000 m, of H.O , pH7.4) containing 0.01% trypsin and 0.1 mM calcium 

" '"°fhe TE-tieated cell suspensions were prepared by the same procedure as for P-paration of TC-treated 
cell suspensions except incubation at 37 » C for 15 min with HCMF solution containing 1 mM EDTA and 

^'^'ThlreTch cell suspension was rinsed with CMF solution twice and divided into cell suspensions 
.0 contrnmg Tx 1o' cells.3 m, in HCMF solution with or without 1 mM calcium chloride. Each cell suspension 
Ts transferred to a 15 ml-size conical tube and was allowed to make cell aggregation by stirnng with 80 
Tpm at 37 C for one hour. The ratio of N1/N0 was calculated by counting with a Cou ter-counter the 
number o ceHs before stirnng (NO) and the number of cell clots after stirnng (N1). The results are show n 
Table 2 the OSF-4- expressing cloned cell lines showed calcim-dependent cell adhesive property similar to 
->s cad ertn Cadhenn molecule ,s known not to be digested by trypsin with calcium Thus, the cell suspension 
treated with EDTA (TE-treated) were digested by trypsin and did not show any cell adhesion. 
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SECKJENCE LISTIWG 

General Information: 
Appl i cant : 



Title of Invention: 
Uiirixr of Sequences: 
Conputer Readable Form: 
MediLin Type: 
Co<Tputer : 
Operating Systefn: 
Software: 



Hoechst Japan Limited 
New Hoechst Building 
10-16, Akasaka 8-chomc 
Minato'ku, Tokyo 
107 Japan 

TeU (03) 5479-5137 
Fax. (03) 3479-7859 

Bc^-related Ca<*.erin- 1 i ke Protein .r^ Process for its Prod^tion 
8 

3,5" HD Diskette 

386 SX 

MS-DOS 

ASCII 
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"Sfrguence L'St'ng" 
SEQ ID NC: 1 

SEOUEMCE "YPE: nucleic acid 
SEQUENCE LENGTH: 3561 b«s« pai^s 

STRANOEDNESS : dooble 

TOPOLCXiY: nr%ear 

lOLECULAR TYPE: cOMA to wHUA 

ORIGINAL SOURCE 
ORGANIS*<; Mus ffijsculus 

STRAIN: osteoblastic cell tir*MC3T3El 
FEATURE : 

SEQUENCE Description: SEQ ID NO: 1: 



OATTCCCGG CCGCCTTTOA AGACATTCAG TTCTGTTATT TATTGAATCiA CCAATCAGAT 60 
GGGTCCAGCA TGTTATAGOA ATTGCCAGCA GGTATCCAAT CGGTGAAGAA GAAGCTGACT 120 
GCGGAGCTGA CCAACCCTGG CGTGATGTCC TCAGTGACTG AAGATATTCC ATTCCAGAGG ISO 
AGGTCTACTT GACACATCTG GGAGGCCGCC ATCCGAAAGA AAGCCACTCT GTTGGTGTAG 240 
GGACTGACAG CTGCATTCTC CTGTGCCTAC TCCATAACCA AAA ATG AAG GAG AAC 295 

Met Lys Glu Asn 
1 

TAC TGT TTA CAA GCT GCC CTG GTG TGC CTG AGC ATG CTA TAG CAC AGC 
Tyr Cys Leu Gin Ala Ala Leu Vsi Cys Leu Sen Met Leu Tyr His Ser 
5 10 15 20 

CAG GCG TTT GCT CTG GAG CGA CGA AGC CAC CTG CAT CCC TCT TTC CAT 391 
Gin Ala Phe Ala Leu Glu Arg Arg Ser His Leu His Pro Ser Phe His 

25 30 35 

GGA CAC CAT GAG AAG GGC AAG GAG CGG CAG GTG CTG CAA CGC TCC AAG 439 
Gly His His Glu Lys Gly Lys Glu Gly Gin Val Leu Gin Arg Ser Lys 

40 45 50 

AGA GGC TGG GTC TGG AAC CAA TTC TTT GTG ATA GAA GAG TAC ACC GGG 457 
Arg Gly Trp VaL Trp Asn Gin Phe Phe Vat lie Glu Glu Tyr Thr Gly 

55 60 65 

CCT GAC CCT GTG CTG GTG GGC AGG CTT CAT TCT GAG ATT GAC TCC GGT 535 
Pro Asp Pro Vat Leu Val Gly Arg Leu His Ser Asp lie Asp Ser Gly 

70 r> 80 

GAT GGG AAC ATT AAA TAC ATT CTC TCA GGT GAA GGA GCG GGA ACC ATT 583 
Asp Gly Asn Me Lys Tyr He Leu Ser Gly Glu Gly Ala Gly Thr Ue 
85 90 95 100 

TTT GTG ATT GAT GAC AAA TCA GGC AAC ATT CAT GCC ACC AAG ACA TTG 631 
Phe VaL lie Asp Asp Lys Ser Gly Asn lie His Ala Thr Lys Thr Leu 
105 110 115 
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- .e. - M. O.n M. v.. «P 

... .sp T.. ..n .eoa.. Pro Pro Ser Otu P.. U. V.. Ly. 

13S 

V.> on *sp,U ..n «p »sn Pro Pro PSe .isC.uUe T.r 

,65 

v.. ,K.MaS.r *SP.U .sp *spPro ^ T.. CW »sn Se. Lys 

V.L TV. S..U..eu ... =WO.n Pre Tyr P.. S.r V.. O.u M. 

200 

CM AC* 0=T *TC ATC »0C =CC CTT CCC *AT ATO «C ACA <;AA GCC 

,W.-.>eAr, T..A..L«. pre Asnne. A.p Ar, O^o A.. 

,,.C>uO>.TvrHisVa. v.. UeGtnA..LVS Asp N.t Cy Cly «.s 

.30 

25" 255 2" 

265 

2S0 

ASP pro ASP a. C.y C.AsnCW.-V.. THr ,yr Asn Ue V.. 

2,5 

AspC.y AspOlvUeO.. Leo P.e O.u U. T.r m Asp Tyr Thr 
3,0 
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823 



871 



919 
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CAG WT GG-' GTG GTG AAG CTG AAA AAG CCT GTA OAT t t T OAA ACC AAA 1303 

Gl^ Asp G:y *'a'. VaL L ys L Lys L ys Pre Val Asp Phe Clu Thr Lys 

32^ 33C 335 3^0 

AGA GCT T*T AGC TTG AAG ATA GAG GCC CCC AAT GTT CAC ATT GAT CCG 1551 

Arg ALa Ty- Ser Leu Lys ILe GLu Ala Ala Asn Val His He Asp Pro 

3^5 350 355 

AAG TTC A^C AGC AA^ GGA CCT TTC AAC OAC ACT CTG ACC GTC AAG ATT 1399 
Lys Phc :;e Ser Asn Gly Pre Phe Lys Asp Thr Val Thr Val Lys He 

360 365 370 

TCA GTA GAA GAT GCC GAT GAG CCT CCC ATG TTC TTG GCC CCA AGT TAT Ui7 
Ser Val Ciu Asp Ala Asp Glu Pro Pro Met Phe Leu Ala Pro Ser Tyr 

375 3fiO 365 

ATC CAT GAA GTT CAA GAA AAT GCA CCT CCT GCC ACT CTG GTT GGG AGA 1495 
He Mis Glu Val Gin Glu Asn Ala Ala Ala Gly Thr Val Val Gly Arg 

390 395 600 

GTA CAT GCC AAA GAC CCA GAT GCT GCC AAC AGC CCA ATA AGG TAT TCA 1543 
Val His Aia Lys Asp Pro Asp Ala Ala Asn Ser Pro lie Arg Tyr Ser 
^OS 410 415 420 

05 ATT GAT CGT CAT ACT GAC CTC GAC AGG TTT TTC ACG ATT AAT CCA GAA 1591 

lie Asp Arg Mis Thr Asp Leu Asp Arg Phe Phe Thr lie Asn Pro GLu 

425 430 435 

GAT GGT TTT ATT AAA ACT ACG AAA CCT CTA GAT AGC GAA GAA ACT CCC 1639 
:jO *sp Gly Phe ile Lys Thr Thr Lys Pro Leu Asp Arg Glu Glu Thr Ala 

440 445 450 

TGG CTC AAC ATC TCT GTC TTC GCA GCA GAA ATT CAC AAC AGA CAT CAG 1657 
Trp Leu As- JLe Ser Va L Phe Ala Ala Glu Ile His Asn Arg Mis Gin 
35 ^55 460 465 

GAA ACC AAA GTC CCA GTG GCC ATC AGG GTC CTC CAT GTC AAT GAC AAT 1735 
Glu Thr Lys Val Pro Val Ala Ile Arg Val Leu Asp Val Asn Asp Asn 
4 70 4 75 4S0 

^0 GCT CCT AAG TTT CCT CCC CCT TAT GAA CCT TTT ATC TCT GAG ACC GAT 1783 

ALa Pro Lys Phe Ala Ala Pro Tyr Glu Gly Phe Ile Cys CLu Ser Asp 
485 490 495 500 

CAC CCC AAG GCA CTC TCC AAC CAG CCA ATA CTT ACA GTT ACT GCA GAT 1831 
His Pro Lys Ala Leu Ser Asn Gin Pro Ile Val Thr Val Ser Ala Asp 

505 510 515 

GAC CAG GAC GAC ACA GCC AAT GGA CCA AGA TTT ATC TTC AGC CTA CCC 1879 
Asp Gin Asp Asp Thr Ala Asn Gly Pro Arg Phe lie Phe Ser Leu Pro 
520 525 530 
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50 



CCT OAA 

Pro Glu 

AAC ACT 
Asn Thr 
550 
CAG GAC 
Gin Asp 
565 

CCT ATG 
Pro Met 

GTG AAT 
Val Asn 

GCC GGT 
ALs Gly 

ATT CTT 
Me Leu 
630 
AAA GAA 

Lys GLu 
645 

ATA ACC 

He Thr 

G>C ATA 
Asp lie 

CGC AAA 

Arg Lys 

CGA CCA 
Arg Pro 
710 
ATA CAG 

lie Gin 
775 



ATC ATG 
lie Met 

535 

GCA GOA 
Ala Gly 

TTC TAC 
Phe Tyr 

AGT AGC 
Ser Ser 



GGG GCA 
Gly Ala 
60 0 
CTG AGC 
Leu Ser 
615 

CTG GTC 
Leu Val 

CCA CTC 
Pro Leu 

TAT GAT 
Tyr Asp 

GCC ACC 
Ala Thr 
680 
GAT ATC 
Asp lie 
695 

GCA CCC 
Ala Pro 

GAG GCA 
Glu Ala 



CAC AAC CCA 

Hi s Asn Pro 

GTA TAT GCC 
Val Tyr Ala 
555 

CTC CTG CCC 
Leu Leu Pro 

570 
ACC AAT ACC 
Thr Asn Thr 
585 

CTG TTG TCC 
Leu Leu Ser 

ACT GGG GCA 
Thr Gly Ale 



ATC GTT GTG 
I le Val Val 
635 

ATT GTA TTT 
I le Val Phe 

650 
GAT GAA GGG 
Asp GLu Gly 
665 

CTG CAG AAT 
Leu Gin Asn 

AAA CCT GAG 

Lys Pro Glu 

AAC ACT CTG 
Asn Ser Val 

715 

GAT AAT GAT 
Asp Asn Asp 

730 



AAC TTC 
Asn Phe 

540 

CGA CGT 
Arg Arg 

ATT GTG 
lie Val 

CTC ACT 
Leu Thr 

TGT AAC 

Cys Asn 
605 
CTG ATC 
Leu lie 
620 

CTG TTT 
Leu Phe 



GAA GAG 

Glu Glu 

CGT GGT 
Gly Gly 



CCT GAC 
Pro Asp 
665 
TAT CAG 

Tyr Gin 
700 

GAT GTG 
Asp Val 

CCC ACA 

Pro Thr 



ACA GTA 

Thr val 

GGA GGG 

Gly Gly 

ATC AGT 
lie Ser 
575 
ATC AAA 
lie Lys 
590 

CCT GAA 
Ala Glu 

GCC ATC 
Ala He 

GTT ACC 
Vel Thr 

GAG GAT 
Glu Asp 
655 
GAG GAA 

Glu Glu 

670 

GGC ATC 
Gly He 



AGA GAC 
Arg Asp 
545 
TTC AGT 
Phe Ser 
5^ 

GAT GGT 
Asp Gly 

CTC TGT 
Val Cys 

CCC TAC 
Ala Tyr 

CTT CCC 
Leu Ala 
625 
CTG AGG 
Leu Arg 
640 

GTC CGT 
Val Arg 

GAC ACT 
Asp Thr 

AAT GGA 
Asn Gly 



TAT ATG 
Tyr Met 

GAC GAC 
Asp Asp 

GCC CCT 
Ala Pro 

735 



CCT AGA 
Pro Arg 
705 
TTC ATC 
Phe He 
720 

CCC TAT 
Pro Tyr 



AAC AGA GAT 
Asn Arg Asp 

CGG CAG AAG 
Arg Gin Lys 

CGC ATT CCA 
Gly He Pro 

GCC TGT GAT 
Gly Cys Asp 

595 
ATC CTG AAT 
He Leu Asn 
610 

TGC ATC GTC 
Cys He Val 

AGG CAA AAG 

Arg Gin Lys 

GAG AAC ATC 

Glu Asn I le 
660 

GAA GCC TTC 
Glu Ala Phe 

675 
TTT ATC CCT 
Phe He Pro 
690 

CCT GGG CTG 
Pro Gly Leu 



AAC ACA AGA 

Asn Thr Arg 

GAC TCC ATC 
Asp Ser He 

740 
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2167 



2215 



2263 
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CAA ATC TAT GGT TAT GAG GGC CGG GGT TCC GTG GCT GGG TCC CTG AGC 2551 
Gin He Tyr Gty Tyr Glu Gly Arg Gly Ser Val Ala Cly Ser Leu Ser 

745 750 755 

TCC TTG GAG TCT GCC ACG ACA GAG TCA GAC CTG GAC TAC GAC TAT CTA 2599 
Ser Leu GLu Ser Ala Thr Thr Asp Ser Asp Leu Asp Tyr Asp Tyr Leu 

760 765 770 

CAG AAC TGG GGA CCT CGT TTT AAG AAA CTG GCA GAC TTG TAT GGC TCC 2647 
Gin Asn Trp Gly Pro Arg Phe Lys Lys Leo Ala Asp Leu Tyr Gly Ser 

775 780 785 

AAA GAC ACT TTT CAT GAT GAC TCT TAACAATAAT GTTACAAATT TGGCCTTAAG 2701 
Lys Asp Thr Phe Asp Asp Asp Ser 

790 795 

AACTGTGTCT GGCATTCTCA AGAATCTAGA AGATGTGTAA ACAGGTATTT TTTTAAATCA 2761 

AGGAAAGGCT CATTTAAAAC AAGCAGAGTT TTACAGAGAG GAAACATTTA ATk,^AACTGC 2821 

AAGGACATCA AAGTGGAAAA TACTGTGAAG TACCTTTTCC CACTTAAAAA GCAAATATTG 2881 

^5 AAGTTGTTTA TCAACTTCAG TAGAAAAAAA AAAACCACTT CCCACACAAA ATATTTAAAT 2941 

GAAGGAGAAG TCCACGGTGA ACTTACAATG AAGGGAAATC GTCTATGTGT TAAGAACATC 3001 

TAAGTCTCTC TTATTTTATT TTTTAATTTG TCAAAGAAGC TTCCACAAAA TTAGAAAGGA 3061 

CAACAGTTCT GAGCTGAAAT TTCGCCTTAA ACTATGGACA CTCTATCTGT AGTCCGTTTT 3121 

TAAACTTTGA ATATATAATA TCCAGCCAGC TTAAACCCAT ACAATGTATG TACAATACAA 3181 

TGTACAATTA TGTCTCTTGA GCATCAATCT TGTTACTGCT GATTCTTGTA AATCTTTTTG 3241 

CTTCTACTTT CATCCTAAAC TAATACGTGC CAGATATAAC TGTCTTGTTT CACTGAGGAG 3301 

CACCCTATTT CTATGTCATT TTTAATGTAT CTATTTGTAC AATTTTAAAG TTCTTATTTT 3361 

AGTATACATA CAAATATCAG TATTCTGACA TGTACGAAAA TGTTACAGCA TCACACTTAT 3421 

ATTTTATGAA CATTGTACTG TTCCTTTAAT ATGAGCTTCA ATATAAGAAG CAACCTTTGA 3481 

^0 AATAAAAAAA AGATTCTTTT TTAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 3541 

AAAAAAAAAA AAAAAAAAAA AAAAAAGCGG CCGCGAATTC 3581 
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SEQ ID MO: 2 

SECXJEMCE TYPE: nucleic acid 
S£OU€«CE LENGTH: 3712 base pairs 

STRANOEDNESS: double 
^ TOf>OLOGY: linear 

MOLECULAR TYPE: cONA to »*MA 

ORIGINAL SOURCE 

ORGAN I S«: Homo sapiens 

TISSUE TYPE: osteosarcoma 

FEATURE: 

SEQUENCE Description: SEQ ID MO: 2: 

GAATTCGflAG ATCTACAGGC GAGGAGAGAT GCCGCCGGCG CCGCTCGCAG CCCCCGCTGA 60 
CTTGTGAATG GGACCGGGAC TGGGGCCGGC ACTGACACCG CAGCGCTTCC CCTGCGCCAC 120 
GGACTGGCGG CTCGGAGGTT GCGTCCACCC TCAAGGGCCC CAGAAATCAC TGTCTTTTCA 180 
GCTCAGCGGC CCTGTGACAT TCCTTCGTGT TGTCATTTGT TGAGTGACCA ATCAGATGGG 240 
TGGAGTGTGT TACAGAAATT GGCAGCAAGT ATCCAATGGG TGAAGAAGAA GCTAACTCCG 300 
GACGTGGGCA GCCCTGACGT GATGAGCTCA ACCAGCAGAG ACATTCCATC CCAAGAGAGG 360 
TCTGCGTGAC GCGTCCGGGA GGCCACCCTC AGCAAGACCA CCGTACAGTT CGTGCAAGGG 420 
GTGACAGCTG CATTCTC-Q TGCCTACCAC GTAACCAAAA ATG AAG GAG AAC TAG 47^ 

Met Lys Glu Asn Tyr 
1 5 

TGT TTA CAA GCC GCC CTG GTG TGC CTC CGC ATG CTG TGC CAC AGC CAT 523 
Cys Leu Gin Ate Ala Leu Val Cys Leo GLy Met Leu Cys His Ser His 

10 T5 20 

GCC TIT GCC CCA GAG CGG CGG GGG CAC CTG CGG CCC TCC TTC CAT GGG 571 
Ala Phe Ala Pro Gtu Arg Arg Gly His Leu Arg Pro Ser Phe His Gly 

25 30 35 

CAC CAT GAG AAG GGC AAG GAG GGG CAG GTG CTA CAC CGC TCC AAG CGT 619 
His His Glu Lys Gly Lys Gtu Gly Gin Val Leu Gin Arg Ser Lys Arg 
35 40 45 50 

GGC TGG GTC TGG AAC CAG TTC TTC GTG ATA CAG GAG TAC ACC CGG CCT 667 
Gly Trp Val Trp Asn Gin Phe Phe Val Ue Gtu Glu Tyr Thr Gly Pro 
55 60 65 

40 CAC CCC GTG CTT GTG GGC AGG CTT CAT TCA CAT ATT CAC TCT CGT GAT 715 

Asp Pro Val Leu Val Gly Arg Leu His Ser Asp Ue Asp Ser Gly A*p 
70 7S 80 85 

GGG AAC ATT AAA TAC ATT CTC TCA GGG GAA CGA GCT GGA ACC ATT TTT 763 
Asri He Lys Tyr He Leu Ser Gly GLu Gly Ala Gly Thr lie Phe 
90 95 100 

GTC ATT GAT GAC AAA TCA GGG AAC ATT CAT GCC ACC AAG ACG TTG GAT 
Val He Asp Asp Lys Ser Gly Asn He His Ala Thr Lys Thr Leu Asp 
105 110 
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CGA OAA GAG *ClA GCC CAG TAC ACG TTG ATG OCT CAG CCG GTG GAC AGG 859 
Arg Glu Glu Arg Ala GLn Tyr Thr Le^j Met Ala Gtn Ala Va I Asp Arg 

120 125 130 

GAC ACC A>^ CGG CCA CTG GAG CCA CCG TCG GAA TTC ATT GTC AAG GTC 907 
Asp Thr A&n A-g Pro L«J Glu Pro Pro S«r Glu Phe lie Va L Lys V« I 

135 UO U5 

CAG GAC ATT GAC AAC CCT CCG GAG TTC CTG CAC GAG ACC TAT CAT 955 

Cln Asp lie Asn Asp Afin Pro Pro Glu Phe Leu His Gtu Thr Tyr His 
150 ^55 160 165 

GCC AAC GTG CCT GAC AGG TCC AAT GTG CGA ACC TCA GTA ATC CAG GTG 1003 
Ala Asn Vat Pro Glu Arg S«r A«n Val Gly Thr Ser Val lie Gin Val 

170 17^ 150 

ACA GCT TCA GAT GCA GAT GAC CCC ACT TAT GGA AAT AGC GCC AAG TTA 1051 
Thr Ala S«r Asp Ala Asp Asp Pro Thr Tyr Cly Asn Ser Ala Lys Leu 

185 190 195 

GTG TAC AC" ATC CTC GAA GGA CAA CCC TAT TTT TCG GTG GAA GCA CAG 1099 
Val Tyr Ser Me Leu Glu Gly Gin Pro Tyr Phe Set Val Glu Ala Gin 

200 205 210 

ACA GGT ATC ATC AGA ACA GCC CTA CCC AAC ATG GAC AGG GAG GCC AAG 1 U7 

Thr Gly He lie Arg Thr Ala Leo Pro Asn Met Asp Arg Glu Ala Lys 

215 220 225 

GAG GAG TAC CAC GTG GTG ATC CAG GCC AAG GAC ATG GCT GGA CAT ATG 
Gtu Glu Tyr Hts Val Val lie Gin Ala Lys Asp Met Gly Gly His Met 
230 235 260 245 

GGC GGA CTC TCA GGG ACA ACC AAA GTG ACG ATC ACA CTG ACC GAT GTC 
Gly Giy Leu Ser Gly Thr Thr Lys Val Thr lie Thr Leu Thr Asp Val 

250 255 260 

AAT GAC AAC CCA CCA AAG TTT CCG CAG ACC GTA TAC CAG ATA TCT GTG 1291 
Asn Asp Asn Pro Pro Lys Phe Pro Gin Ser Val Tyr Gin lie Ser Val 

265 270 275 

TCA GAA GCA GCC GTC CCT GGG GAG GAA GTA GGA AGA GTC AAA CCT AAA 1339 
Ser Glu Ala Ala Val Pro Cly Glu Glu Val Gly Arg Val Lys Ala Lys 

2BC 285 290 

GAT CCA OAC ATT GGA GAA AAT CCC TTA GTC ACA TAC AAT ATT GTT CAT 1387 
Asp Pro Asp ite Gly Glu Asn Gly Leu Val Thr Tyr Atn He Val Asp 

295 300 305 

GCA CAT GGT ATC GAA TCG TTT CAA ATC ACA ACG CAC TAT CAA ACA CAG U35 
Gly Asp Gly Met Glu Ser Phe Glu He Thr Thr Asp Tyr Glu Thr Gin 
310 315 320 325 
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GAC GGC GTG ATA AAG CTG AAA AAG CCT CTA GAT TTT GAA ACC AAA AGA 
Glu Gly Val lie Lys Leu lys Lys Pro Val Asp Phe Glu Thr Lys Arg 

330 335 340 

CCC TAT AGC TTC AAG GTA GAG GCA GCC AAC GTG CAC ATC GAC CCG AAG 
Ala Tyr Ser Leu Lys Val Glu Alt Ala Asn Val Kis lie Asp Pro Lys 

345 350 355 

TTT ATC AGC A>T GGC CCT TTC AAG GAC ACT GTG ACC GTC AAG ATC GCA 
Phe lie Ser Asn Gly Pro Phe Lys Asp Thr Val Thr Val Lys lie Ala 

360 165 370 

GTA GAA GAT GCT GAT GAG CCC CCT ATG TTC TTC GCC CCA ACT TAC ATC 
Val Glu Asp Ala Asp GLu Pro Pro Met Phe Leu Ala Pro Ser Tyr [le 

375 380 385 

CAC GAA GTC CAA GAA AAT CCA GCT GCT GGC ACC GTG GTT GGG AGA GTG 
His Glu Val Gin Glu Asn Ala Ala Ala Gly Thr Val Val Gly Arg Val 
390 395 400 405 

CAT GCC AAA GAC CCT GAT GCT GCC AAC AGC CCG ATA AGG TAT TCC ATC 
His Ala Lys Asp Pro Asp Ala Ala Asn Ser Pro lie Arg Tyr Ser He 

410 415 420 

GAT CGT CAC ACT GAC CTC GAC AGA TTT TTC ACT ATT AAT CCA GAG GAT 
Asp Arg His Thr Asp Leu Asp Arg Phe Phe Thr He Asn Pro Glu Asp 

425 430 435 

GGT TTT ATT AAA ACT ACA AAA CCT CTG GAT AGA GAG GAA ACA GCC TGG 
Gly Phe He Lys Thr Thr Lys Pro Leu Asp Arg Glu Glu Thr Ala Trp 

440 445 450 

CTC AAC ATC ACT GTC TTT GCA GCA GAA ATC CAC AAT CCC CAT CAC GAA 
Leu Asn He Thr Val Phe Ala Ala Glu He His Asn Arg His Gin Glu 

455 460 465 

GCC AAA GTC CCA GTG GCC ATT AGG CTC CTT GAT GTC AAC CAT AAT GCT 
Ala Lys Val Pro Val Ala He Arg Val Leu Asp Val Asn Asp Asn Ala 
470 475 480 485 

CCC AAG TTT GCT GCC CCT TAT GAA CGT TTC ATC TCT GAG ACT GAT CAC 
Pro lys Phe Ala Ala Pro Tyr Glu Gly Phe He Cys Glu Ser Asp Gin 

490 495 500 

ACC AAG CCA CTT TCC AAC CAG CCA ATT CTT ACA ATT ACT GCA GAT GAC 
Thr Lys Pro Leu Ser Asn Gin Pro He Val Thr He Ser Ala Asp Asp 

505 510 515 

AAG GAT GAC ACG GCC AAT GGA CCA AGA TTT ATC TTC ACC CTA CCC CCT 
Lys Asp Asp Thr Ala Asn Gly Pro Arg Phe He Phe Ser Leu Pro Pro 
520 525 530 
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(U> ATC ATT CAC AAT CCA A>T ACA GTC AGA uAC AAC COA GAT AAC 2107 

Glu Me lie his Asn Pro Asn Phe Thr Val Arg Asp Asn Arg Asp Asn 

535 5^0 545 

ACA GCA GGC G^G TAC GCC CGG CGT GO GGG TTC ACT CGG CAG AAG CAG 2155 
Thr AU Gly Val Tyr Ala Arg A^fl Gly Gly Phe Ser Arg Gin Lys Gin 
550 555 560 565 

GAG TTC TAC CTT CTG CCC ATA GTG ATC AGC GAT GCC GCC ATC CGG CCC 2203 
Asp Le^ i.eu Leu Pro lie v«l He Ser Asp Gly Gty lie Pro Pro 

570 575 5dO 

ATG AGT AGC ACC AAC ACC CTC ACC ATC AAA GTC TCC GGG TCC GAG GTG 2251 
Met Ser Se- Thr Asn Thr Leu Thr lie Lys V«l Cy« Gly Cys Asp Val 

585 590 595 

AAC GGG GCA CTG CTC TCC TGC AAC GCA GAG GCC TAC ATT CTG AAC GCC 2299 
Asn Gly Ala Leu Leu Ser Cys Asn Ale Glu Ata Tyr Me Leu Asn Ala 

60C 605 610 

GGC CTG AGC ACA CGC GCC CTG ATC GCC ATC CTC GCC TCC ATC GTC ATT 2J47 
Gty Levj Ser Thr Gly Ala Leu He Ala Me Leu Ala Cys Me Val Me 

615 620 625 

CTC CTG GTC ATT GTA GTA TTG TTT GTG ACC CTG AGA AGC CAA AAG AAA 2395 
Leu Leu Va: Ite Vat Val Leu Phe Val Thr Leu Arg Arg Gin Lys Lys 
630 635 640 645 

GAA CCA CTC ATT dc TTT GAG GAA GAA GAT GTC CGT GAG AAC ATC ATT 2443 
Glu Pro Leu Me VaL Phe Glu Glu Clu Asp Vat Arg Glu Asn Me Me 

650 655 660 

ACT TAT GAT GAT GAA GGC GGT GGG GAA GAA CAC ACA GAA CCC TTT GAT 2491 
Thr Tyr Asp Asp Glu Gly Gly Gty Glu Glu Asp Thr Glu Ala Phe Asp 

665 670 675 

ATT GCC ACC CTC CAG AAT CCT CAT CGT ATC AAT GCA TTT ATC CCC CCC 2539 
lie Ata Thr Leu Gin Asn Pro Asp Gly Me Asn Gly Phe Me Pro Arg 
6fiO 685 690 

4Q AAA GAC ATC AAA CCT GAG TAT CAG TAC ATC CCT AGA CCT GGG CTC CGC 2587 

Lys Asp Me Lys Pro Glu Tyr Gin Tyr Het Pro Arg Pro Gly Leu Arg 

695 700 705 

CCA CCC CCC AAC AGC GTG GAT GTC OAT GAC TTC ATC AAC ACC ACA ATA 2635 
45 Al» ^""0 Asn Ser Vat Asp Val Asp Asp Phe Me Asn Thr Arg He 

710 715 720 725 

CAC CAG GCA GAC AAT CAC CCC ACC CCT CCT CCT TAT CAC TCC ATT CAA 2683 
Gin Glu Ala Asp Asn Asp Pro Thr Ala Pro Pro Tyr Asp Ser Me Gin 
5 0 73 0 73 5 740 



35 



21 



30 



35 



40 



45 



50 



EP 0 585 801 A2 

ATC TAC GOT TAT GAA GGC AGO GGC TCA GTG GCC GCG TCC CTG AGC TCC 2731 
lie Tyr Gly Tyr Glu Gly Arg Gly Ser Val Al. Gly Ser Leu Ser Ser 

745 750 

CTA GAG TCG GCC ACC ACA GAT TCA GAC TTG GAC TAT GAT TAT CTA CAG 2779 
Leu Glu ser Al. Thr Thr Asp Ser Asp Leu Asp Tyr Asp Tyr Leu Gin 

760 765 770 

AAC TGG GGA CCT CGT TTT AAG AAA CTA GCA GAT TTG TAT GCT TCC AAA 2827 
Asn Trp Gly Pro Arg Phe Lys Lys Leo Al. Asp Leu Tyr Gly Ser Lys 

775 780 785 

GAC ACT TTT GAT GAC CAT TCT TAA CAATAACGAT ACAAATTTGG CCTTAAGAAC 2881 
Asp Thr Phe Asp Asp Asp Ser 
790 795 

TGTGTCTCGC GTTCTCAAGA ATCTAGAAGA TGTGTAAACA GCTATTTTTT TAAATCAAGG 2941 

AAAGGCTCAT TTAAAACAGG CAAAGTTTTA CAGAGAGCAT ACATTTAATA AAACTGCGAG 3001 

GACATCAAAG TGGTAAATAC TGTGAAATAC CTTTTCTCAC AAAAAGGCAA ATATTGAAGT 3061 

TGTTTATCAA CTTCGCTAGA AAAAAAAAAC ACTTGGCATA CAAAATATTT AAGTGAAGGA 3121 

GAAGTCTAAC GCTGAACTGA CAATCAACGG AAATTGTTTA TCTGTTATGA ACATCCAAGT 3181 

CTTTCTTCTT TTTTAAGTTG TCAAAGAAGC TTCCACAAAA TTAGAAAGGA CAACAGTTCT 32A1 

GAGCTGTAAT TTCGCCTTAA ACTCTGGACA CTCTATATGT AGTGCATTTT TAAACTTGAA 5301 

ATATATAATA TTCAGCCAGC TTAAACCCAT ACAATCTAT6 TACAATACAA TGTACAATTA 3361 

TGTCTCTTGA GCATCAATCT TGTTACTGCT GATTCTTGTA AATCTTTTTC CTTCTACTTT 5421 

CATCTTAAAC TAATACGTGC CAGATATAAC TGTCTTGTTT CAGTGAGA6A CGCCCTATTT 5481 

CTATGTCATT TTTAATGTAT CTATTTGTAC AATTTTAAAG TTCTTATTTT AGTATACATA 3541 

TAAATATCAG TATTCTGACA TGTAAGAAAA TGTTACGCCA TCACACTTAT ATTTTATGAA 5601 

CATTGTACTG TTGCTTTAAT ATGAGCTTCA ATATAAGAAG CAATCTTT6A AATAAAAAAA 3661 

GATTTTTTTT TAAAAAAAAG GAGATCTACA GGCCTGTAGA TCTCCGAATT C 3712 
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SEC 10 HO: 3 

SEQUENCE MPE: nucleic acid 
SEQUENCE LENGTH: 39U base p«irs 

STRAMDEDNESS : dojbie 
TOPOLOGY: linear 
MOLECULAR rrPE: cONA to k«NA 

ORIGINAL SOURCE 
ORGANISM: Hoto sapiens 
TISSUE TYPE: osteosarcoma 

FEATURE : 

SEQUENCE Description: SEO ID MO: 3: 

GAATTCGGAG ATCTACAGGC CCGCGACGCT CCCCTCACCT GGCGGCGGCC GCGGAGAGAT 
GCCGCGGGCG CCGCTCGCAG CCGCCGCTGA CTTGTGAATG GGACCCGGAC TGGGGCCCGG 120 
ACTGACACCG CAGCGCTTGC CCTGCGCCAG CCACTGGCGG CTCCGAGGTT GCGTCCACCC 180 
TCAAGGGCCC CAGAAATCAC TGTGTTTTCA GCTCAGCGGC CCTGTGACAT TCCTTCGTGT 240 
TGTCATTTGT TGAGTGACCA ATCAGATGGG TCGAGTGTGT TACAGAAATT GGCAGCAAGT 300 
ATCCAATGGG TGAAGAAGAA GCTAACTCGG GACGTGGGCA GCCCTGACGT GATGAGCTCA 360 
ACCAGCAGAG ACATTCCATC CCAAGAGAGG TCTGCGTGAC GCGTCCGGGA GGCCACCCTC 420 
AGCAAGACCA CCGTACAGTT GGTGGAAGGG GTGACACCTG CATTCTCCTG TGCCTACCAC 480 
GTAACCAAAA ATG AAG GAG AAC TAC TGT TTA CAA GCC GCC CTC GIG TGC 529 
Wet Lys Glu Asn Tyr Cys Leu Gtn Ala Ala Leu Val Cys 
1 5 10 

CTG GGC ATG CTG TGC CAC AGC CAT GCC TTT GCC CCA GAG CGC CGC GGG 577 
Leu Gty Met Leu Cys nis Ser His Ala Phe Ala Pro Glu Arg Arg Gly 

15 20 25 

CAC CTG CGG CCC TCC TTC CAT GGG CAC CAT GAG AAC GGC AAG GAG GGG 625 
His Leu Arg Pro Ser Phe His Gty His His Glu Lys Gty Lys Glu Gly 
30 35 40 45 

CAG GTG CTA CAG CGC TCC AAG CGT GGC TGG GTC TGG AAC CAG TTC TTC 673 
Gin Val Leu Gtn Arg Ser Lys Arg Gly Trp Val Trp Asn Gin Phe Phe 

50 55 60 

GTG ATA GAG GAG TAC ACC GGC CCT GAC CCC GTG CTT GTG CGC ACG CTT 721 
Val He Glu Glu Tyr Thr Gly Pro Asp Pro Val Leu Val Gly Arg Leu 

65 70 75 

CAT TCA GAT ATT GAC TCT CCT GAT CGC AAC ATT AAA TAC ATT CTC TCA 769 
His Ser Asp lie Asp Ser Gly Asp Gly Asn He Lys Tyr He Leu Ser 

SO 85 90 

GGG GAA CGA GCT GGA ACC ATT TTT GTG ATT GAT GAC AAA TCA GGC AAC 817 
Gly Glu Gly Ale Gly Thr He Phe Val Me Asp Asp Lys Ser Gly Asn 
95 100 105 
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ATT CAT GCC ACC ACG TTG GAT CGA OAA GAG AGA GCC CAG TAC ACG 

He H,s Ala Thr Lys Thr Le^ Asp Aro Glu Glu Arg Ala Gtn Tyr Thr 

110 115 

TTG ATG GCT CAG GCG GTG CJkC AGG GAC ACC AAT CGC CCA CTG GAG CCA 
Leo Met Ala Gin Ala Vet Asp Arg Asp Thr Asn Arg Pro Leu Glu Pro 

150 135 HO 

CCG TCG GAA TTC ATT GTC AAG GTC CAC GAC ATT AAT GAC AAC CCT CCG 
Pro Ser Glu Phe ile Val Lys Val Gtn Asp lie A»n Asp Asn Pro Pro 

145 150 155 

GAG TTC CTG CAC GAG ACC TAT CAT GCC AAC GTC CCT GAG AGG TCC AAT 
Glu Phe Leu Mis Gtu Thr Tyr Hi. Al. Asn V.l Pro Glu Arg Ser Asn 

160 165 1^ 

GTG GGA ACG TCA GTA ATC CAG GTG ACA GCT TCA GAT GCA GAT GAC CCC 
Val GLy Thr Ser Val lie Gin Val Thr Ala Ser Asp Ala Asp Asp Pro 

175 180 185 

ACT TAT GOA AAT A6C CCC AAG TTA GTG TAC AGT ATC CTC GAA GGA CAA 
Thr Tyr Gly Asn Ser Ala Lys Leu Vet Tyr Ser tie Leu Glu Gly Gin 
190 1^^ 200 205 

CCC TAT TTT TCG GTG GAA GCA CAG ACA GGT ATC ATC AGA ACA GCC CTA 
Pro Tyr Phe Ser Val Glu Ala Gin Thr Gly He lie Arg Thr Ala Leu 

210 215 220 

CCC AAC ATG GAC AGG GAG GCC AAG GAG GAG TAC CAC GTG GTG ATC CAG 
pro Asn Met Asp Arg Glu Ala Lys Glu Glu Tyr His Val V.l He Gin 

225 230 235 

GCC AAG GAC ATG GGT GGA CAT ATG GCC GGA CTC TCA GGG ACA ACC AAA 
Ala Lys Asp Met Gly Gly His Met Gly Gly Leu Ser Gly Thr Thr Lys 

240 2^5 250 

GTG ACG ATC ACA CTG ACC GAT GTC AAT GAC AAC CCA CCA AAG TTT CCG 
val Thr He Thr Leu Thr Asp Val Asn Asp Asn Pro Pro Lys Phe Pro 

255 260 265 

CAG AGC GTA TAC CAG ATA TCT GTC TCA GAA GCA CCC GTC CCT GGG GAG 
Gin ser Val Tyr Gin Ue Ser V.l Ser Glu Al. Ala V.l Pro Gly Glu 
270 275 2»0 285 

GAA GTA GGA AGA GTG AAA CCT AAA CAT CCA GAC ATT CGA GAA AAT GGC 
Glu V.l Gly Arg Val Lys Al. Lys Asp Pro Asp Ue Gly Glu Asn Gly 

290 2^ 
TTA GTC ACA TAC AAT ATT GTT GAT GGA GAT GGT ATG GAA TCG TTT GAA 
Leu Vat Thr Tyr Asn He Val Asp Gly Asp Gly Met Glu Ser Phe Glu 
305 310 315 
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ATC AC* ACG OAC TAT UAA ACA CAG OAG GGG GTG ATA AAG CTG AAA AAG U89 
lie Thr Thr Asp Tyr Glu Thr Gin Glu GLy V«l lie Lys Leu Lys Lys 

320 325 330 

CCT GTA GAT TTT GAA ACC AAA AGA CCC TAT AGC TTC AAC GTA GAG CCA 1537 
Pro VaL Asp Phe Glu Thr Lys Arg Ala Tyr Ser Leu Lys Va( Glu Ala 

335 540 545 

CCC AAC GTG CAC ATC GAG CCC AAG TTT ATC ACC AAT GCC CCT TTC AAG 1585 
Ala Asn Val His lie Asp Pro Ly« Phe lie Ser Asn Gly Pro Phe Lys 
350 355 360 365 

CAC ACT GTG ACC GTC AAG ATC GCA CTA CAA CAT CCT CAT GAG CCC CCT 1633 
Asp Thr Val Thr Val Lys lie Ala Val Glu A»p Ala Asp Ctu Pro Pro 

370 375 5«0 

ATG TTC TTC GCC CCA ACT TAC ATC CAC GAA GTC CAA GAA AAT GCA GCT 1651 
Net Phe Leu Ala Pro Ser Tyr I Le His Glu Val Gin Glu Asn ALa Ala 

365 390 395 

GCT GGC ACC GTG GTT GGG AGA GTG CAT GCC AAA GAC CCT GAT GCT GCC 1729 
ALa GLy Thr Val Val Gly Arg Val His Ala Lys Asp Pro Asp Ala Ala 

i^OO 405 410 

AAC AGC CCG ATA AGG TAT TCC ATC GAT CCT CAC ACT CAC CTC GAC AGA 1777 
Asn Ser Pro He Arg Tyr Ser lie Asp Arg His Thr Asp Leu Asp Arg 

i15 420 425 

TTT TTC ACT ATT AAT CCA GAG GAT GGT TTT ATT AAA ACT ACA AAA CCT 1825 
Phe Phe Thr lie Asn Pro Glu Asp Gty Phe He Lys Thr Thr Lys Pro 
i30 435 440 445 

CTG GAT AGA GAG GAA ACA GCC TCG CTC AAC ATC ACT GTC TTT GCA GCA 1873 
Leu Asp Arg Glu Glu Thr Ala Trp Leu Asn lie Thr Val Phe Ala Ala 

450 455 460 

GAA ATC CAC AAT CGC CAT CAG GAA GCC AAA GTC CCA GTG GCC ATT ACG 1921 
Glu He His Asn Arg His Gin Glu Ala Lys Val Pro Val Ala lie Arg 

465 470 475 

CTC CTT GAT GTC AAC CAT AAT GCT CCC AAC TTT CCT GCC CCT TAT CAA 1969 
Vat Leu Asp Val Asn Asp Asn ALa Pro Lys Phe Ala Ala Pro Tyr Glu 

480 485 490 

GCT TTC ATC TCT GAG ACT GAT CAC ACC AAG CCA CTT TCC AAC CAG CCA 2017 
Cly Phe lie Cys Glu Ser Asp Gin Thr Lys Pro Ltu Ser Asn Gin Pro 

495 500 505 

ATT GTT ACA ATT ACT GCA GAT GAC AAG GAT GAC ACG CCC AAT GGA CCA 2065 
He Vat Thr He Ser Ala Asp Asp Lys Asp Asp Thr Ala Asn Gly Pro 
510 515 520 525 
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AGA TTT ATC TTC AGC CTA CCC CCT GAA ATC ATT CAC AAT CCA AAT TTC 
Ars Phe lie Phe Ser leu Pro Pro Glu U« He His Asn Pro Asn Phe 

530 535 540 

ACA GTC ACiA GAC AAC CGA GAT AAC ACA GCA GGC GTG TAG GCC CGG CCT 2161 
Thr Vfll Arg Asp Asn Arg Asp Asn Thr AIb Gly V»l Tyr Ala Arfl Arg 

5A5 550 555 

GGA GGG TTC AGT CGG CAG AAG CAG GAC TTG TAC CTT CTG CCC ATA GTG 2209 
Gly Gly Phe Ser Arg Gin Lys Gtn Asp Leu Tyr Leu leu Pro He Val 

560 565 570 

ATC AGC GAT GGC GGC ATC CCG CCC ATG AGT AGC ACC AAC ACC CTC ACC 2257 
tie Ser Asp Gly GLy lie Pro Pro Het S«r Ser Thr Asn Thr Leu Thr 

575 580 585 

ATC AAA GTC TGC GGC TGC GAC GTG AAC GGG GCA CTG CTC TCC TGC AAC 2305 
He Lys Val Cys Gly Cys Asp Val Asn Gly Ala Leo leu Ser Cys Asn 
590 595 600 605 

GCA GAG GCC TAC ATT CTG AAC GCC GGC CTG AGC ACA GCC GCC CTG ATC 2353 
Ala Glu Ala Tyr ILe Leu Asn Ala Gly Leu Ser Thr Gly Ala Leu He 

610 615 620 

GCC ATC CTC GCC TGC ATC GTC ATT CTC CTG GGT TGC CCA AGC TTA ATG 2401 
Ala He Leu Ala Cys He Val He Leu Leu Gly Cys Pro Ser Leu Met 

625 630 635 

GAA CCC CCC TCT CCC AGG GAA GAC ATG AGA TTC CTT TAT CTG GGC TTC 2449 
Glu Pro Pro Ser Pro Arg Glu Asp Met Arg Leu Leu Tyr Leu Gly Phe 

6A0 6^5 650 

CAG CTG ATG CTA TTT TCC TAT GTT AAA GTA AAC AGA AGA TTT TGT CTT 2497 
Gin Leu Het Leu Phe Ser Tyr Val Lys Val Asn Arg Arg Phe Cys Leu 

655 660 665 

CTG GGG GTC TTT ATA AAA CTT CCT TTC CTC TAT GTG GTG CCT ACA GAC 2545 
Leu Gly Val Phe He Lys Leu Pro Phe Leu Tyr Val Val Ala Thr Glu 
670 675 680 6fi5 

AGT CCA ACC ACA CTT ACG TCA TTG TAGTATTGTT TGTCACCCTG AGAAGGCAAA 2599 
Ser Pro Thr Thr Leu Thr Ser Leu 
690 

AGAAAGAACC ACTCATTGTC TTTGAGGAAG AAGATCTCCG TGAGAACATC ATTACTTATG 2659 
ATGATGAAGG GGGTGGGGAA GAAGACACAG AAGCCTTTGA TATTCCCACC CTCCAGAATC 2719 
CTGATCGTAT CAATGGATTT ATCCCCCCCA AAGACATCAA ACCTGAGTAT CAGTACATGC 2779 
CTAGACCTGG GCTCCGGCCA CCGCCCAACA GCGTCGATGT CGATCACTTC ATCAACACGA 2839 
GAATACAGGA GGCAGACAAT CACCCCACCG CTCCTCCTTA TGACTCCATT CAAATCTACG 2899 
GTTATGAAGG CAGGGGCTCA GTGGCCGCGT CCCTGACCTC CCTAGACTCC GCCACCACAG 2959 
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ATTCAGACTT GCiACTATGAT TATCTACAOA ACTGGGGACC TCGTTTT/kAG AAACTAGCAG 3019 

ATTTGTATGG TTCCAAAGAC ACTTTTGATG ACGATTCTTA ACAATAACGA TACAAATTTG 3079 

GCCTTAAGAA CTGTGTCTGG CGTTCTCAAG AATCTAGAAG ATCTGTAAAC AGGTATTTTT 3139 

TTAAATCAAG GAAAGGCTCA TTTAAAACAC GCAAAGTTTT ACAGAGAGGA TACATTTAAT 3199 

AAAACTGCGA GGACATCAAA GTGGTAAATA CTGTGAAATA CCTTTTCTCA CAAAAAGGCA 3259 

AATATTGAAG TTGTTTATCA ACTTCGCTAC AAAAAAAAAA CACTTCGCAT ACAAAATATT 3319 

TAAGTGAAGG AGAAGTCTAA CGCTGAACTC ACAATGAAGG CAAATTGTTT ATGTGTTATG 3379 

AACATCCAAG TCTTTCTTCT TTTTTAACTT GTCAAAGAAG CTTCCACAAA ATTAGAAAGG 3A39 

ACAACAGTTC TGAGCTGTAA TTTCGCCTTA AACTCTGGAC ACTCTATATG TAGTGCATTT 3499 

TTAAACTTGA AATATATAAT ATTCAGCCAG CTTAAACCCA TACAATGTAT GTACAATACA 3559 

ATGTACAATT ATGTCTCTTG AGCATCAATC TTGTTACTCC TCATTCTTGT AAATCTTTTT 3619 

GCTTCTACTT TCATCTTAAA CTAATACGTG CCAGATATAA CTGTCTTGTT TCAGTGAGAG 5679 

ACGCCCTATT TCTATGTCAT TTTTAATGTA TCTATTTGTA CAATTTTAAA GTTCTTATTT 3739 

TAGTATACAT ATAAATATCA GTATTCTGAC ATGTAAGAAA ATGTTACGGC ATCACACTTA 3799 

TATTTTATGA ACATTGTACT GTTGCTTTAA TATGAGCTTC AATATAAGAA CCAATCTTTG 3859 

AAATAAAAAA AGATTTTTTT TTCGGAGATC TACAGGCCTG TAGATCTCCG AATTC 3914 
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SEQ ID MO: U 

SEOOEMCE TYPE: nucleic acid 
SEQUENCE LENGTH: 21 base pairs 

STRANOEDNESS: sir>flle 
TOPOLOGY: linear 

WOLECULAR TYPE: other nucleic acid 

ORIGINAL SOURCE 
ORGAN I SH: none 
STRAIN: none 

feature: Unk.r DMA -ith seg^nce co«pl««nt.ry to SEQ 10 NO: 5. t.n«d -ATOS-V 

SEQUENCE Description: SEQ ID MO: ^: 

CTCTTGCTTG /U^TTCGGACT A 21 



SEO ID NO: 5 

SEQUENCE TYPE: nucleic acid 
SEQUENCE LENGTH: 25 base pairs 

STRANDEDNESS: single 
TOPOLOGY: linear 

HOLECULAR TYPE: other nucleic acid 

ORIGINAL SOURCE 
ORGANISM: none 
STRAIN: none 

FEATURE: linker ONA -ith sec^>ence c«,^le«nt.ry to SEQ 10 NO: 4, t.n«d "ATOS-Z" 

SEQUENCE Description: SEO ID NO: 5: 

TAGTCCGAAT TCAAGCAAGA GCACA 25 



SEQ ID NO: 6 

SEQUENCE TYPE: nucleic acid 
SEQUENCE LENGTH: 21 base pairs 

STRANDEDNESS: single 
TOPOLOGY: Linear 

MOLECULAR TYPE: other nucleic acid 

ORIGINAL SOURCE 
ORGANISM: none 
STRAIN: none 

• ^ -#«*rttiirv to SEQ 10 NO: 7, tertned "ATOS-4" 

FEATURE: linker DNA with sequence carp lenient a ry to itw 

SEQUENCE Description: SEQ ID NO: 6: 
CTCTTGCTTA AGCTTGGACT A 21 



40 SEQ ID MO: 7 

SEQUENCE TYPE: nucleic acid 
SEQUENCE LENGTH: 25 base pairs 

STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULAR TYPE: other nucleic acid 

ORIGINAL SOURCE 
ORGANISM: none 
STRAIN: none 

feature: Unk.r ONA with c»^l««nt.ry to SEQ .0 HO: 6. termed -ATOS-5" 

SEQUENCE Description: SEO ID NO: 7: 
50 TACTCCAAGC TTAAGCAAGA GCACA 25 
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SEQ ID NO; 8 

SeOUEMCE TYPE: «nir>o acid 
SEQUENCE LENGTH: 15 wnino acids 

TOPOLOGY: lir>ear 
MOLECULAR TYPE: peptide 

ORIGINAL SOURCE 
ORGANISM: Hus muscuius 

FEATURE: OSF-4.1 (antigen peptide) 
LOCATION: 

segment of nouse OSF-4 from the 101st to the 115th amino acid residue 
SEQUENCE Description: SEO ID MO: 6: 

Phe Val lie Asp Asp Lys Ser Gly Asn lie His Ala Thr Lys Thr 
5 10 15 
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1. A protein comprising mOSF-4 having an amino acid se^quence at the 25th to 796th positions in 
Sequence ID No. 1 of the Sequence Listing, hOSF-4-1 having an ammo acid sequence at the 25th to 
796th positions in Sequence ID No. 2 of the Sequence Table, or hOSF-4-2 having an ammo acid 
sequence at the 25th to 693rd positions in Sequence ID No. 3 of the Sequence Listing; or an analogue 
of the mOSF-4. hOSF-4-1 or hOSF-4-2: or a fragment of the mOSF-4, hOSF-4-1 or hOSF-4-2. 

2. A protein comprising mOSF-4 precursor protein having an ammo acid sequence at the 1st to 796th 
positions, including a signal peptide at the 1st to 24th positions, in Sequence ID No. 1 of the Sequence 
Listing, hOSF-4-1 precursor protein having an ammo acid sequence at the 1st to 796th positions, 
including a signal peptide at the 1st to 24th positions, in Sequence ID No. 2 of the Sequence Listing, or 
hOSF-4-2 precursor protein having an ammo acid sequence at the 1st to 693rd positions, including a 
signal peptide at the 1st to 24th positions, m Sequence ID No. 3 of the Sequence Listing; or an 
analogue of each precursor protein; or a fragment of each precursor protein. 

3. DNA or RNA coding for the protein of Claim 1 or 2. 

4. DNA or RNA hybridizing under stingent coditions with DNA or RNA according to claim 3. 



5. A process for the production of a recombinant mammalian OSF-4 protein according to claim 1 , or an 
analogue thereof, or a fragment thereof, comprising the steps of: 

•^^ (a) obtaining a population of cells containing a heterogeneous DNA composed of the following DNA 

sequences: 

(i) a sequence which can function in the cells to control transcription and translation, and 

(ii) a DNA sequence joined downstream of said controlling sequence to code for said recombinant 
protein, and 

(b) cultunng said population of cells under conditions which permit the production of said recom- 
binant protein. 

6. The process of Claim 5, wherein the controlling sequence further contains a DNA coding for a signal 
peptide for secreting said recombinant protein extracellularly such that said DNA is positioned 

50 immediately upstream of said DNA sequence coding for said recombinant protein, 

7. The process of Claim 5 or 6. wherein the population of cells is Escherichia cdi. or yeast, or mammalian 

cells. 

55 8. A diagnostic reagent for bone metabolic diseases, containing the whole or a fragment of the DNA or 
RNA of Claim 3 or 4. 

9. A diagnostic reagent for bone metabolic diseases, containing the protein of Claim 1. 
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10. A polyclonal oi .monoclonal antibody against the protein of Clai-m 1 

11. A diagnostic reagent for bone rr^etaboUc diseases, containing the antibody of Clam 
5 12. A therapeutic agent for bone metabolic diseases, containing the protein of Clairr. 1 
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1 2 3 /, 5 6 7 8 9 10 1112 13 

* • • ♦ • • n OSF-/, 

^^^^•^^^^m^^^ \ p-actin 

1 THYMUS 

2 SPLEEN 

3 BRAIN 

L KIDNEY 

5 LIVER 

6 LUNG 

7 TESTIS 

8 HEART 

9 OSTEOBLAST- ENRICHED CELL FROM MOUSE CALVARIA 

10 MC3T3-E1 CELLS FROM 3 DAYS CULTURE 

11 MC3T3-E1 CELLS FROM 12 DAYS CULTURE 

12 MC3T3-E1 CELLS FROM 60 DAYS CULTURE 

13 NIH3T3 CELLS 
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Table 5: Aggregation Assay of the cloned cell lines expressing mouse OSF-4 



No. of cell clots after 1 hr (N1)/ 
No. of cells pre-stirring (NO) 

Ceil lines — ■ — 

with 1 mM Ca^^ without 1 nnM Ca^ ^ 



TC-treated 

L-cell (Control) I.OOCI) 1.00 

CI 0.65 1.00 

C7 0.29 1.00 

C11 0.51 1.00 

mock(*2) 1.00 1.00 
TE-treated 

C7 1.00 1.00 



( * 1 ) The smaller the number of cell clots after one-hour stirring, the stronger 
cell aggregation occured. The number 1.00 means no aggregation. 
(•2) "mock" refers to the control cell into which the vector alone was 

introduced. 



OSF-4 provided by the present invention can be used as an agent for treating bone metabolic diseases, 
and Its high organ specificity for bones enables its use as a diagnostic reagent for bone metabolic diseases. 
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O A bone-related protein named OSF-4 which is 
obtained from bone tissue of a mammal including 
mouse or human, and a process for its production. 
This protein is a novel naturally occurring mammal 
protein of the cadherin family. 

OSF-4 acts as an adhesion molecule or a growth 
factor which takes part m the process of 
osteogenesis at the site of bone induction. OSF-4 
can be used as an agent for treating bone metabolic 
diseases, and its high organ specificity for bones 
enables its use as a diagnostic reagent for bone 
metabolic diseases. 
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